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Abstract 

, In this paper, we consider a three node, two-way relay system with digital network coding. The aim is to 

■ minimize total energy consumption while ensuring queue stability at all nodes, for a given pair of random packet 

(N ■ 

arrival rates. Specifically, we allow for a set of transmission modes and solve for the optimal fraction of resources 



o 



allocated to each mode. First we formulate and solve the static-channel problem, where all link gains are constant 
over the duration of transmission. Then we solve the fading-channel problem, where link gains are random. We call 
■ the latter the ergodic energy efficiency problem, and show that its solution has a water-filling structure. Finally, we 

provide a detailed analysis of the queues at each node using a random scheduling method that closely approximates 
the theoretical design, through a two-dimensional Markov chain model. 
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I. Introduction 

in_ 

^ . Network coding Q has emerged as a viable means to improve throughput in complex networks. 
O. 

^ Messages at the packet level are linearly combined at intermediate nodes and forwarded to multiple 
intended destinations. Using knowledge of the manner in which messages were combined, communicated 
^ . through some additional overhead bits, as well as knowledge of the message it contributed, a destination can 
successfully reconstruct the message intended for it. In this way, network throughput is greatly improved. 

The two-way relay network [HI exemplifies the use of network coding in digital communication. This 
simple network comprises two source nodes (^i and S2) and one relay node {R), where 5*1 and ^2 have 
information to exchange with each other. A direct link between Si and S2 is unavailable. This model 
applies for instance to communication between a base station and a mobile user in cellular communications, 
in which the mobile user is in a location shadowed from the base station and coverage is provided in 
the area by means of a relay. Another typical application is that of satellite communication, where two 
satellites have messages for each other and can only communicate through a ground station. 

Parts of this work were presented at IEEE ICCC (International Conf. on Communications in China) 2012, and submitted to IEEE ICCS 
(International Conf. on Communication Systems) 2012. The authors are with the Department of Electrical and Computer Engineering, 
National University of Singapore, Singapore, 117583. Tel: +65 6601 2055. Fax: +65 6779 1103. Emails: {elecz, eletj, motani} @nus.edu.sg. 
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With the aid of network coding, we can exchange two messages in two time slots with physical 
network coding |l6l, ifTOl . [[TT]| or three slots with digital network coding H, [|5l, compared with the four 
slots needed with pure forwarding. With network coding, the relay receives messages from both source 
nodes, then combines these messages and broadcasts the network-coded packet to 5*1 and 5*2. The rate 
region for DNC-based two-way relay networks was first explored in Q with various transmission 
modes including one-way forwarding and network-coded broadcasting. The former also investigated queue 
stability in the case of random data arrivals at each source node. The seminal work on physical layer 
network coding (PNC) presented in [61 showed that two slots is sufficient for two packet transmissions 
provided that receiver-side SNR is sufficiently high and perfect synchronization could be achieved. [|71 
and |[8l discussed the practical design of asynchronous PNC systems. Multiuser detection was applied at 
the relay node to exploit asynchronous transmission in the uplink to form a network coded packet in the 
downlink. 

In addition to throughput, resource allocation was investigated in BUl, lfT2ll . lfT3ll . [fTSll . In BU, optimal 
resource allocation with data fairness was discussed. Optimal resource allocation for analog network 
coding in a MIMO two-way relay network was investigated in [lOJ. In [[T2| and lfT3l . optimal resource 
allocation for an ANC-based system was considered for OFDM systems. In [15], resource allocation was 
investigated under the scenario of asymmetric multi-way relay communication over orthogonal channels. 

In [[T4|. the fading nature of a wireless channel was taken into consideration and the optimal position 
for the relay node was investigated. In [fT6ll . outage regions of DNC and ANC strategies were derived. It 
found that with the presence of a direct link, DNC was more promising than ANC in most cases in terms 
of outage performance. 

In this paper, we formulate, simplify and solve the problem of allocation of channel resources to the 
various transmission modes in a DNC-based two-way relay network to minimize total energy usage in 
transmission. The constraint in the optimization problem is that queue stability is maintained at all four 
queues in the network for a given pair of average packet arrival rates Ai and A2 at 5*1 and S2, respectively. 
The scheduler, e.g., the relay node, is assumed to have full knowledge of the instantaneous channel 
coefficients in the four flat-fading links. 

There are two scenarios to consider in such an energy minimization problem: (i) static channels , which 
remain unchanged over the entire duration of the transmission; and (ii) fading channels, which change 
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with time so that over the duration of the transmission, all channel states are visited. For static channels, 
queue stability requires that throughput from 5*1 (5*2) is at least Ai (A2) for a given set of channel gains; 
for fading channels, queue stability is guaranteed as long as throughput from 5*1 averaged over the fading 
distribution is at least Ai. In the fading case, the resulting optimization problem is solved by a water-filling 
procedure over the state space of the fading gains. This is tied to the solution of the static channel problem, 
which we will show to be equivalent to a convex optimization problem and hence is easily solved. 

Assuming ergodicity in the channel processes, the fading-channel solution minimizes the long-term 
average energy used to deliver the arrival rate pair (Ai,A2). The derivation of this result is a natural 
complement to the first static-channel energy optimization problem. 

Subsequently, we also design and simulate a random scheduling protocol based on the two designed 
resource allocation policies. To maintain finite queue length at each node, we introduce a back-off 
parameter e, so that the designed arrival rate is Aj(l -I- e) for an actual arrival rate of Aj. This leads 
to the total energy usage being smaller than the objective function used in our design optimization. We 
investigate the behavior of the queues at the source nodes and the relay, for the random scheduling 
algorithm. Finding the average delay and queue length at Si and 5*2 is straightforward, however the 
analysis of the two queues at the relay (for data from 5"! and S2 respectively) required the use of a 
two-dimensional Markov chain. We present and verify through simulations this analysis in the second 
part of this paper. 

The rest of this paper is organized as follows. In Section II, we introduce the energy usage optimization 
problem over static channels and provide the solution to this problem. In Section III, we introduce the 
ergodic energy usage optimization problem and present the associated solution with the water-filling 
structure. In Section IV, we present an energy efficient random scheduling protocol and provide the 
queueing analysis for this protocol. Simulation results are presented in Section V and Section VI concludes 
this work. 

Notation: Scalars are denoted by lower-case letters and bold-face lower-case letters are used for vectors 
and bold-face upper-case letters for matrices. In addition, we use I and to denote the identity matrix 
and the all-unity matrix respectively. 1 is the row vector with all elements equal to unity. 




Fig. 1. System model for a two-way relay network with random packet arrivals. 



II. Static Channel Problem 



In this section, we discuss the static channel problem, where the channel gains gir, etc. depicted in 
Figure [U are deterministic, modelling the block-fading case, or simply the case of fixed channels as seen 
in wired systems. 

A. System Model and Problem Formulation 

Figure [H depicts the two-way relay network of interest. Assume that packets arrive at Si according to a 
Poisson process at an average arrival rate of A,, ? G {1, 2}, and we seek to maximize the energy efficiency 
of this system for a given average arrival rate pair (Ai, A2) while maintaining queue stability, by adjusting 
the fraction of time allocated to each of the following five transmission modes: 

• Mode 1: Si transmits to R at rate Ri (uplink phase). 

• Mode 2: S2 transmits to R at rate R2 (uplink phase). 

• Mode 3: R broadcasts to Si and 5*2 at broadcast rate R3 (Network coding in downlink phase). 

• Mode 4: R transmits only to Si at rate (one-way forwarding in downlink phase). 

• Mode 5: R transmits only to 5*2 at rate -R5 (one-way forwarding in downlink phase). 

The four channel (power) gains gi^, g2r, gri and (7^2, correspond to modes 1,2, A and 5 respectiveljH . 
For convenience, we define gi = gir, g2 = g2r, 93 = "^Mgn, 9r2), 94 = 9ri and g^ = g^2 so that gi is the 
channel gain in Mode i. Receiver noise for each mode is modeled by i.i.d. Gaussian random variables 
with zero mean and unit variance. The power transmitted in Mode i is denoted Pj. In other words, the 
signal to noise ratio (SNR) at the receiver in Mode i is 



SNR, = Pigi. 



To achieve a rate of Ri in Mode i therefore requires that Pi be exponentially relateco to Ri, via 

2«« _ 1 



Pi{Ri 



9i 



1,...,5. 



(1) 



(2) 



'Note that whether Qri = Qir is inconsequential in this work. 

^An SNR gap term F can be inserted into the expression but this only changes the effective channel gain to Qi/T and does not affect the 
following discourse. 



It should be noted that in Mode 3, the rate of -R3 is the rate transmitted to each of Si and 5*2, due to the 
use of network coding. 

Suppose that a fraction /j, z = 1, . . . , 5, of each time slot is allocated to Mode i respectively. Therefore 
fiPi{Ri) is proportional to the energy used for transmission in Mode i, if we disregard the possibility that 
queues may be empty. To minimize the total energy consumption per time slot while satisfying queue 

stability constraints therefore requires the solution of optimization problem PI: 
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min y^miR,) (3) 

fi,Ri — 7 

subject to the queue stability constraints 

Ai < minifiRiJsR-i + fM (4) 

A2 < mmif2R2, fsR-s + URi) (5) 

and the physical constraints Yl^=i /« ^ 1 ^^'^ ft ^ 0- the next section, we show that PI can be 
transformed into a simpler convex optimization problem with only /j as the design parameters and hence 
can be solved easily. 

Finally, the queue lengths (in terms of packets) at Si and 5*2 at epoch t are denoted by Qi{t) and Q2{t)- 
We also define Qri{t) and Qr2{^) as the lengths of the two queues maintained at the relay node for Si 
and 5*2, respectively. 

B. Problem Solution 

Problem PI may be simplified through the following lemmas. 
Lemma 1: Under the optimal solution to PI, we have 

/4 = 0, if Ai > A2 

/s = 0, if Ai < A2 ^""^ 
and /4 = /5 = if Ai = A2. 

Proof: Denote the optimal values of /j and Rt by /* and R*. Constraints (31) and (|5]) dictate that 

f;R; + f;R; > \, (7) 
flRl + flRl > A2 (8) 

Assume that f^Rl + flRl - A2 = e > 0. Then reducing fl by e/R^ while keeping all other /* and all 
R* fixed results in constraint ([5]) still being satisfied, with a lower total energy. Therefore we can assume 
that f*Rl + flRl = \2. Similarly, + f*Rl = Ai. 



Suppose that > 0. If we reduce by 64, constraint ^ requires that /g be increased by 64RI/RI. 
This in turn means that can be reduced by S^Rl/ R^. In other words, queue stability is maintained 
under the following adjustment: 

/s ^ f; + 64Rl/Rl 

n ^ n-5,Ri/Ri 

The total energy used is now reduced by 

= 6, (^^p; + p: - §^p;^ (9) 

where P* is shorthand for Pi{R*). Note that R^, R\ and R^ need only satisfy two equations equivalent 
to 

flRl-flRl = A1-A2 (10) 

We can thus always find a point in the solution space of (flOl) to make the term in brackets in ^ positive. 
So as long as ^4 > does not lead to a violation of (flOl) . we can reduce total energy by AE > 0. If 
Ai > A2, this result and (fTOl) show that we should have /4 = 0, and /gi^g = Ai - A2. 

Similarly, if Ai < A2, then = and /4*i?4 = A2 - Ai; and if Ai = A2, then = = 0. ■ 
A corollary of Lemma [T] is that, since either one or both of fl and /g must be 0, /g cannot be zero, 
i.e. the broadcast phase (Mode 3) must be used. This is intuitive because one bit transmitted in Mode 3 
is equivalent to two bits delivered, and hence we should devote as much resources as possible to Mode 
3. The following lemma links Ri to /j. 
Lemma 2: The solution to PI satisfies 

m = J, ^ = 1,2 (11) 

minfAiiA?) 

R*3 = — (12) 

J3 

Rl = max(^^^y^,0^ (13) 



Rt = max(^i-^,0) (14) 



Proof: By reducing fi and/or total energy is reduced, and so the energy-minimization problem 
must be solved when /i-Ri is at its smallest value, i.e. flR[ = \i. Similarly, /2-R2 = ^2- 

Equations (fT3l) and (fT4l) arise directly from the proof of Lemma [TJ Equation (fT2l) comes from noting 
that if Ai < A2, Lemma [U states that /g = and hence f^Rt, = \i while if Ai > A2, /3-R3 = A2. ■ 

Lemma |2] implies that {Ri} can be removed as optimization variables, leaving only {fi} in an equivalent 
optimization problem. Lemma \T\ (and also Lemma O implies that when Ai > A2, Mode 4 (forwarding 
from the relay to 5*1) is not necessary, and that when Ai < A2, Mode 5 is not necessary. Hence there is 
no loss of generality in assuming that Ai < A2 from this point on, so that Mode 5 is no longer discussed 
and the min and max functions need not be invoked in Lemma [2l We now have 

R; = ^, i?: = and i?* = 0. (15) 

By substituting the new expressions for {R^} into PI, assuming Ai < A2, we get the equivalent 
optimization problem P2: 

+/4P4 ( (16) 



s.t. /. > (17) 
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$^/^<l (18) 

i=l 

where the power functions Pj(-) are given by 

It is not difficult to show that the cost function in P2 is convex in {fi}, and clearly the constraint set 
is also convex. Therefore the original problem PI has been turned into an equivalent convex optimization 
problem P2. 

It should be noted too that the cost function in P2 is monotonically decreasing in {fi} and hence its 
solution must lie on the boundary of the constraint set. Since fi = for all i is clearly not a viable 
solution, it must be that J2i fi = ^ i-^-' that all available time resources are fully utilized. This is easily 
understandable since increasing any fi leads to a corresponding reduction in the associated R^, which 
means an exponential decrease in required power and hence we would want fi to be as large as possible. 



Since P2 is a constrained convex optimization problem, its solution is found by solving the the Karush- 

Kuhn-Tucker (KKT) equations, which are easily derived as 

277(1-^) . 

+13* = (19) 



9i 9i 



A. 



2 



271(1-^) . 
^2 — + f3* = (20) 

92 92 
2t(l_Aito2) . 

^2 — + = (21) 

93 93 



2^^*(1 _ (^2-Ai)ln2 



r — ) 1 

+/3* = (22) 



94 9a 
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= (23) 



i=l 

where 13* is the Lagrange multiplier. Since (fT9l)-(l22l) are transcendental equations, it is infeasible to obtain 
explicit solutions in general and numerical methods are instead employed to obtain the solution to problem 
P2. 

C. Discussion and Insights 

Firstly, we give a lemma that links each active mode (i.e., those for which /* > 0) to the optimal 
solution to P2. 

Lemma 3: Under the optimal solution to P2, we have for each active mode that 

?^i<l^^IM + /!--i = 0. (24) 

9i 9i 

Note that this result immediately follows from equations (fT9l)- (|22l) and therefore the proof is omitted. 

Based on Lemma [H some observations can be made and are presented in the following proposition. 

Proposition 1: The optimal transmit rates and optimal transmit powers in respect of channel gains for 
active modes (i.e., /*, f* > 0) are related as follows: 

R*>R* and P* < P* ^ g, > (25) 
R*=R* and P* = P* ^ g^ = g, (26) 

where i,j = 1, 2, 3, 4, 5. In other words, R* — R* has the same sign as gi — gj whereas P* — P* has the 
opposite sign. 



Proof: Let us multiply by Qi in (|24l) . we then have the following equality for each active mode, 

2^^ (1 - i?* In 2) + [5*gi -1 = 0. (27) 
Note that R* is an implicit function of gi. Taking the derivatives on both sides in (ITTI) with respect to 



Qi we obtain 



-2^*(Rnn2)2^+/3* = 0, (28) 



from which it follows that 

dRl (3* 



> 0. (29) 



2«:(/?*ln2)2 

The inequality comes from the fact that R* for active modes and (3* are positive values. Hence Ri is a 
monotonically increasing function of gi. 

Recalling that R* = log2(l + -P/fifj), (|27l) can also be written as 

{P*g, + 1)(1 - ln(l + P:g,)) + /3*g, -1 = 0. (30) 



Taking derivatives with respect to g^ on both sides of (|30l) we arrive at 



dP* 

- ln(l + P*g,){P: + g.-^) + = 0. (31) 

dgi 



(32) 
(33) 



With some manipulations we can obtain (l32l)- (|34l) below. 

dP* - P*\n{l + P*gi 



dgi gi\n{l + P*gi) 

\n{l + P:gi)-P:g, 



g^^Hl + P:gi) 

< (34) 

where (|33l) comes directly from (l30l) and (|34l) comes from the fact that ln(l + x) < x if x > 0. Hence 
P* is a strict decreasing function of gi. 

Considering both (l29l) and (|34l) . Proposition [H is proved. ■ 
It is interesting that Proposition [H implies that the relative values of R* and P* depend only on the 
relative values of the corresponding channel gains, but not at all on the average arrival rates. The optimal 
time fraction for each mode, however, does relate to the corresponding arrival rate in the form f* = 

Note also that Proposition [H which relates the optimal power, rate and link gain in each mode, is 
useful for initializing the power and rate optimization routine so that the iterative procedure converges 
more quickly. For instance, if gi > g2, we should initialize P2 to exceed Pi, and Ri to exceed R2. 
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In addition, for the case that Ai and A2 are very small (A, <S 1), we can use a Taylor series to approximate 
the exponential functions and obtain the closed form power allocation solution for each mode, with f3 
obtained through a one-dimensional bisection search: 

1 

1 \ 2 

(35) 
(36) 
(37) 



/* = Ai 1 + 

91 

/* = A2(l + /3*-- 

92 



/* = Ai 1 + 



1 

93 



f* = - AO (^1 + /3* - 

/: = i-/r-/2-/3- 



1 \ ^ 
94 



(38) 
(39) 



We should also emphasize that when Ai > A2, the last two terms in the cost function of P2 change to 

and the problem can be solved with the appropriate substitutions. Hence there is no loss in generality in 
the assumption that Ai < A2 that we used for the majority of this section. 

III. Maximizing Ergodic Energy Efficiency in Fading Channels 

Unlike in the previous section, assume now that Qi, i — 1, . . . , 5, are random variables with known 
density functions p{gi). Assuming ergodicity in the random processes gi{t), where t represents time, 
minimization of average total transmit energy in the long run is formulated as P3: 



subject to 



mm 



(40) 



In the above 



Ai < min(/i^i,/3^3 + /5^5) 
A2 < mm{f2R2j3R3 + URa) 



i=l 



Pi = E[Pi(Ri(gi))] = I Pi(Ri(gi))p(gi)dgi, 

'0 



(41) 
(42) 
(43) 
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and 



Ri = E[Ri{gi)] = / Ri{gi)p{gi)dgi 
Jo 



where Pj and Ri are averaged over the channel gain distributions. We term problem P3 an ergodic energy 
efficiency maximization problem. Note that the minimization is over Ri{gi)) once the functional form 
of Ri{gi) is found, we obtain Pi{Ri{gi)) from ^ and hence Pi, while Ri is obtained through averaging 
Ri{gi) over gi. 

In other words, solving problem P3 yields a set of time fractions {/*} that minimizes an upper bound 
on the long-term average energy used, while guaranteeing long-term queue stability, assuming that the 
scheduler has knowledge of the instantaneous channel gains gfj, i = 1, . . . , 5, as well as the distribution of 
gi. At the same time, we obtain a rate allocation R*i{gi) dependent on the instantaneous channel gains gi. 

It can be verified that Lemmas 1-2 still hold for this optimization problem and hence it can be translated 
into another equivalent optimization problem, as shown presently. 

Again assuming without loss of generality that Ai < A2, P3 reduces to P4: 



mm 

fi,Ri{gi) 



J2m (44) 



i=l 



subject to 

Ai = hRi = hRz (45) 

A2 = /2i?2 (46) 

A2-A1 = /4P4 (47) 
4 

< 1. (48) 

i=l 

The Lagrangian function corresponding to P4 is 

44 / 4 \ 

F = -Y,W^R^ - A.) +7 5^/. - 1 (49) 

i=l i=l \i=l / 

where we used the notation A3 = Ai and A4 = A2 — Ai to denote virtual arrival rates for Mode 3 and 
Mode 4 respectively. 
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Fig. 2. Water filling diagram for each link. 



Since P4 is a convex optimization problem, its solution is given by the KKT conditions: 



2^* 

/;— In 2 -/;/?; 



0, 1 < i < 4 

0, 1 < i < 4 

0, 1 < i < 4 

A*, 7* > 0, l<i<4 

E/* = 1 

i=l 

where the asterisks denote optimal values. 

It can be found from (|52l) that ^ = (3* logg e and recall the power-rate equality in Q we have 

1 



(50) 
(51) 
(52) 
(53) 
(54) 



9i 



Then by definition, 



R* 



\og,il + P*iR*)gM9^dg^ 



13* log2 e 



log2( A*fl'i log2e)]9(5fi)c^5'i 



(55) 



(56) 



(57) 



Therefore the power allocation solution over each link is of the form of water-filling and the illustrative 
diagram is shown in Fig. [21 

Note that P* can also be found in terms of /3* and 7*. The optimal /* is given by (ISTI) . Finally, we 
obtain (3* and 7* from (l50l) by performing a multidimensional bisection search method to make /3*Ri — Pi 
equal (to 7*) and positive for all i. 



13 

It is interesting to note that similar observations to Proposition \T\ can be made for the ergodic energy 
optimization problem for Rayleigh fading channels. They are given in the following proposition. The 
detailed proof however is left in the appendix. 

Proposition 2: Under the optimal solution to ergodic energy consumption optimization problem over 
Rayleigh fading channels, the relation of the optimal transmit power and rate of each mode with the 
associated average link gain can be given by, 

P* < P* and R* > R* ^ g,> (58) 
P* = P* and R* = R* ^ g, = g^ (59) 

where i,j = 1, 2, 3, 4, 5 if /*, f* > 0. In other words, P* is a decreasing function of (ji whereas R* is an 
increasing function of gi in the ergodic case as well. 

IV. Scheduling Protocol: Queuing and Actual Energy Efficiency 

We now consider scheduling strategies which make use of the optimal time-sharing fractions {/j} (and 
hence constant rates {Ri} for PI and varying rate for P3) for each transmission mode when solving energy 
minimization problems PI and P3 respectively. We then describe below an intuitive protocol called the 
energy-efficient random scheduling protocol (EERSP). 

1) At the start of a packet interval of duration T, the relay randomly chooses a transmission mode 
with probability P[Mode i] = 

2) If Mode 1 is selected, the relay will inform Si to transmit. If Mode 2 is selected, the relay will 
inform 5*2 to transmit. The selected source will transmit if its data buffer is non-empty. 

3) If Mode 3 is selected, the relay will transmit. If both queues at R are non-empty (i.e., mm{Qri{-), Qr2i)) > 
0), it will broadcast network coded packets to both sources. If only Qri{-) > (or only (5r2(') > 0), 

the relay will forward data to Si (or S2). If both queues at R are empty, it will remain silent. 

4) If Mode 4 is selected, the relay will forward packets to Si if Qri(') > 0. If Mode 5 is selected, the 
relay will forward packets to ^2 if Qr2(') > 0. 

A. Queue Analysis for PI 

We now analyze the queuing performance of EERSP. The data arrival processes at both 5*1 and ^2 
are assumed to be Poisson. For brevity, we shall only focus on the queue length Qi{t) and Qr2{t). The 
analysis for Q2{t) and Qri{t) is very similar to that for Qi{t) and Qr2{t) and is omitted for brevity. 
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Here we use a two-dimensional Markov chain to model the backlog states of {Qi{t), Qr2{t)) and define 
q{m,k),{i,j) as the one-step transition probability of the event that Qr2{t + l) = j and Qi{t + 1) = i given that 
Qr2{t) — k and Qi{t) — m. There are four classes of queue transitions arising from the five transmission 
modes: 

Class I: 5*1 is selected for transmission in Mode 1; 
Class II: R is selected for transmission in Mode 3; 
Class in: Mode 2 or Mode 4 is selected; 
Class IV: R is selected for transmission in Mode 5. 

Without loss of generality, assume that one time slot of duration T corresponds to one channel use, 
and that capacity is measured in units of packets. Hence, for Mode 1 transmission, where a rate R\ is 
supported, we assume that a maximum of R{ packets can be transmitted from 5*1 to R. In general, in 
Mode i, a maximum of R* packets can be transmitted. The queue analysis below follows naturally from 
this view. 

For Class I up to R\, packets may be received reliably at R from Si, the conditional one-step transition 
probability is 

{Oj, m = 0,j = k 

ai, <m< Rl,j ^k + m (60) 

ai^rn+Ri , m> RIJ ^ k + Rl 

where 

a^. = lM2!exp(-AiT) (61) 

is the probability that j packets arrive at within the current slot. For the the case where Qi{t) = 0, no 
packets are queued for transmission by ^i, therefore Qr2{t + 1) = Qr2{t). For Qi{t + 1) to be equal to 
i, there must have been i arrivals at Si in the duration T of the {t + l)-st slot. 

For the case where < Qi{t) < i?*, all queued packets at 5"! will be transmitted during slot t + 1 and 
hence Qr2{t + 1) = Qr2{t) + Qi{t). Therefore, Qi{t + 1) is determined by the number of packets arrived 
during slot t + 1. 

For the case that Qi{t) > R^, we have Qr2{t -I- 1) Qr2{t) + Ri since can transmit at most R\ 
packets during one slot. Qi{t + 1) is thus determined both by the number of residual buffed packets 
Qi{t) — R\ and the number of new packet arrivals, i.e, Qi{t — Qi{t) — R^+i provided that there 
are i arrivals during slot t + 1. It should be noted that there are no other possible transitions. 
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Fig. 3. Two-dimensional Markov chain corresponding to EERSP protocol 



Under Class II, R will transmit network coded data to 5*2 in Mode 3. In this scenario, queue length 
increment at is determined by the number of arrived packets at within this slot. For queueing at 
R for 5*2, there will be two cases after transmission. If Qr2{t) is less than R^, it will be emptied at slot 
t + 1. Otherwise, there will be still Qr2{t) — R^ packets buffered at R for 52. The conditional transition 
probability thus can be given as by 



Q(m,k),(i,j) 



O'i-rri} J — 0, < -R3 
O'i-rri} 3 — k — -R3, k > -R3 



(62) 



Under Class III, Qr2 remains the same in the next slot. Only the external arrival at 5*1 will change the 
state of Qi{t). The conditional transition probability thus is 



Q{m,k),(i,j) Oji—mi j k 



(63) 



The analysis for Class IV is similar to that for Class 11 and is omitted for brevity. 

Combining all these modes and current queue state, the state space and corresponding transition 
probability for Qi{t) and Qr2{t) of the EERSP protocol are given below and is shown in Fig. [3l 

To simplify the analysis, the one-step transition probability matrix of the two-dimensional Markov 
chain can be written as that of a one-dimensional Markov chain, i.e., P = {qmn+k.in+j} where n is a large 
positive number and q(m,k),{i,j) is represented by the element of Prnn+k,in+j- The stationary distribution 
thus is 

7r = l-(I-P + 0)-^ (64) 
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The average queue length at and that at R for ^2 then are 

oo n 

Ql = ^^^T^in+j (65) 

oo n 

Qr2 = ^^JTT^n+j (66) 

B. Queue and Delay Analysis for P3 

The analysis of queueing for the fading channel problem is similar to the analysis of the static channel 
problem in the above subsection. The four classes of state transitions still apply, but the rates supported 
in each mode are no longer constant. They are instead random variables due to (|55] ). which makes R* 
a function of gi, a random variable. Therefore, we define c„ as the probability that Si is only able to 
transmit n packets in one slot if it is selected for transmission, while r„ and g„ are the probability that R 
can transmit n packets in one slot if it is selected for transmission in Mode 3 and Mode 5 respectively. 

For Class I, packets may be received reliably at R from Si, the conditional one-step transition probability 
then is 

a,, m = 0, j = k 

qim,k),{i,j) = { ai- E^=m Cn, < m,j = k + m (67) 
aj_^+„c„, 0<n<m,j = k + n 



Where Oj is as defined in the previous subsection. The first term on the right hand side of (1671) represents 
the case that Qi{t) is zero. As no packets are queued at 5*1 for transmission, we have Qr2it + 1) = Qr2{t) 
and Qi{t+1) is determined by the number of packets arrival in the duration T of the t+lst slot. The second 
term accounts for the case that the instantaneous transmit rate R*i{t) of is higher than Qi{t) and hence 
Qr2{t + 1) = Qr2{t) + Qi{t)- The probability that R\{t) >= Qi{t) is given by Yl^Q^{t) ^i- In this case, 
Ql is also determined by the number of new arrivals during slot t+1 since packets buffered previously 
are totally delivered to R. For the third term where R*i{t) < Qi(t), we have Q2{t + 1) = Q2(t) + Rl(t) 
since only as most Rl{t) packets can be transmitted during one slot. Qi(t+1) is hence given by the sum of 
the number of residual buffered packets and the number of new arrivals, i.e., Qi{t + 1) = Qi{t) — Rl{t) + i 
given there are i arrivals during slot t + 1 . 

For Class II, R will transmit network coded data to 5*2 in Mode 3. The conditional transition probability 
is given as follows. 

Qim,kU,j) \ a,.^r^, j = k-n,0<n<k ^^^^ 
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Where queue length increment at 5*1 is determined by the number of arrived packets at within this slot. 
For queueing at R for 5*2, Qr2{t+1) is determined by current buffering length Qr2{t) and the instantaneous 
broadcasting rate of R, i.e., R^{t). If Qr2{t) < R^it), it will be emptied at slot t + 1. Otherwise, there 
will be still Qr2{t) — Rsit) packets buffered at R for 5*2. 

Similarly, we can get the one-step transition probability for Class HI and Class IV. Combining them 
together, the one-step transition probability of the EERSP protocol with ergodic energy optimization can 
be given as follows. 



In a similar manner, average queue length of 5*1 and that of R for 5*2 can be characterized by using 

C. Practical Energy Consumption with Positive e 

To ensure reasonable running times and memory requirements in simulation, we design for average 
arrival rates that are slightly larger than the actual rates, i.e., — t- (1 + e)Aj {i = 1,2), where e is a 
small positive value. In this case it should be noted that the actual consumed energy might be lower 
than the designed energy consumption with positive e since there might be idle slots of each active mode 
in implementation. Therefore, it is interesting to investigate actual energy consumption in the regime of 
positive e and this is what will be done in this section. Note that here we only focus on the constant link 
gain scenario. The practical energy consumption under the scenario of fading channel can be derived in 
a similar manner and is hence omitted. 

Let /* be the optimal time fraction of each mode for virtual arrival rate pair (Ai(l + e), A2(l + e)). Let 
P* be the optimal power level and R* the optimal transmit rate of Mode i for virtual arrival rate pair. Let 
Tcjj be the stationary distribution for queue pair (<5i(t), Qr2(i)) and nfj be the stationary distribution of 
queue pair {Q2{t), Qriit)) respectively for actual arrival rate pair. Considering the two exclusive scenarios 
that Qi{t) < R* and that Qi{t) > R* together, we can derive the actual energy consumption for each 
active mode respectively. 



Q(m,k),(i,j) 



= < 




Clj, 

(1 ~ /l + flCojO'i-rm 

{fsrn + hq 



m = k = j = 

m>0,k=j = 

j = k — n,k>0,0<n<k 

j = 0, > 

j = k > 0,171 = 

j = k > 0,m > 

m>0,j = k + n,0<n<'m — 1 

< m, j = k + m 

else 



(69) 
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For Modes 1 and 2, the actual energy consumed can be given by, 

oo oo ^=-^'-1 oo . O*^ — 1 \ 

E E^^r^ ^ = i'2 (70) 

k=R* j=o j=o ^ y« / 

where the first term accounts for the case that current queue length is higher than transmit rate of this 

mode whereas the second term accounts for Qi{t) < R*. 

Similarly, for Modes 4 or 5 whichever is active (/* > 0), the actual energy consumed is given by, 

oo oo k=R*-l oo ^ Ofc _ 1 \ 

^; = E E -ti'Pt + E E ■ ' = 4(5) (71) 

fc=i?* j=o j=o ^ / 

For Mode 3, the actual energy consumed is given by, 

/ • 2 

max{l,j)>R* max(i,j)<=i?* ^ 

Combining them together with optimal time fraction assigned for each mode, actual energy consumed 
per slot for this entire network, Ead, is given by 

5 

K,t = Y,f:E[. (73) 

i=l 

We shall compare this actual energy consumption with the simulation result in the following section. 

V. Simulation 

We now present simulations to verify our findings and explore tradeoffs. To ensure reasonable running 
times and memory requirements, we design for average arrival rates that are slightly larger than the actual 
rates, i.e., Aj — )■ (1 + e)Aj (z = 1, 2), where e is a small positive value. Noise at each node is assumed to 
be Gaussian with zero mean and unit variance and Rayleigh fading models are used for each link. 

In Fig. m the data arrival rate at 5*2 is A2 = 1. An iterative sequential quadratic program (SQP) is 
used to solve PI directly, and P2 is solved through a numerical solution of the KKT equations. It can be 
observed that the solution to PI and the solution to P2 matches for different channel gain realizations and 
varying Ai. The equivalence of problems PI and P2 is therefore verified. In a similar manner, we have 
also numerically verified that P3 and P4 are equivalent, but omit the results in the interest of conciseness. 

In Fig. |5J the energy efficiency of short term digital network coding and the conventional scheme 
(without network coding) are compared for some specified channel realizations. The energy required for 
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supporting data arrival rate with conventional transmission is obtained by solving the problem below. 




s.t. h > (75) 

5 

$^/i = l (76) 

1=1 

h = (77) 

It can be observed that energy efficiency is greatly improved by employing network coding. For instance, 
in the case of unit channel gains, energy is reduced from 31 units to 15 units, at Ai = 1.5. 

In Fig. |6l the energy consumption of the ergodic digital network coding solution P3, static-channel 
digital network coding solution PI and the ergodic conventional non-NC solution are compared, where 
the conventional non-NC solution is obtained by solving P3 with /s = 0. The energy consumption of the 
static-channel DNC solution was obtained by averaging over 200 channel realizations. 

It can be observed that energy efficiency is greatly improved by employing network coding. For instance, 
in the case of unit average channel gain, energy usage is reduced from over 50 units to less than 30 units, 
at Ai = 1.5. It is also observed that with network coding, optimizing over long term varying link gain can 
save even more energy, since it mitigates the adverse effect of deep fading scenario compared with short 
term DNC solution. Under the scenario of unit average channel gains, total energy consumed is reduced 
from over 70 units to less than 30 units, at Ai = 1.5. 

We study the actual energy consumption by implementing EERSP protocol for the static-channel DNC 
in Fig.|71 In simulation, we ran 10^ slots and channels of each link are kept static throughout the simulation. 
It can be seen that the analytical result of actual energy usage matches well with the simulation result. 
As expected, the actual energy usage is less than the upper bound determined by the optimal energy 
minimization solution because the backoff parameter e was non-zero. 

In the following, we show the tradeoff between energy efficiency and queue length. The data arrival 
process at each source node is assumed to be Poisson. We also set Ai = 0.5 frame/slot and A2 = 1 
frame/slot and each slot spans 1ms. The bandwidth is set to be IkHz. All link gains are assumed to be 
unity and Gaussian noise at each node is assumed to have zero mean and unit variance. 

In Fig. [8] and Fig. |9l average queue length of the EERSP protocol for PI and for P3 are shown 
respectively. It can be seen that the analytical result and simulation for queue length at both Si and at 
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2 1 ~ 1 

0.5 1 1.5 

Fig. 4. Optimal total energy consumption for some specified channel realizations for constant channel gains. A2 is set to be constant 

R for 5*2 match with each other perfectly. It is also observed that with increasing e, the queue length at 
Si decreases quickly due to higher transmission rate for each mode. However, increasing e results in an 
increased probability of idle slots as we are over-provisioning the system, and therefore lower bandwidth 
efficiency. The factor e thus controls the trade-off between bandwidth efficiency and buffering delay. 

VI. Conclusion 

In this work, the minimal-energy allocation of resources to the five transmission modes in a two- 
way relay network with digital network coding at the relay and stochastic packet arrivals at the two 
source nodes was obtained. Both static and fading channels were accounted for, in the static-channel and 
ergodic energy minimization problems respectively. The proposed solutions ensure stability of all four 
queues in the network for arbitrary average packet arrival rates. The ergodic energy minimization solution 
has a water-filling structure and lower energy usage compared to re-designing the system to match the 
instantaneous channel gains using the static-channel energy minimization solution in a fading channel. In 
addition, a practical scheduling protocol was introduced to implement the proposed resource allocation 
solutions, and an exact queuing analysis of the protocol was obtained. As two-way relay networks appear 
in many applications including cellular networks and satellite systems, the work presented here is an 
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Fig. 5. Optimal total energy consumption comparison of conventional scheme and digital network coding scheme for constant channel 
gains. A2 is set to be constant. 
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0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 



Fig. 6. Optimal total energy consumption comparison for EERSP protocol. A2 is set to be constant. Case I denotes the case that g^^ = 
92r = Sri = 9r2 = 1- Casc II denotcs the case that g^^ = g^^ = l and g^^ = g^i = 2. 



important step towards the realization of a practical and useful communication network setup. 
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Fig. 7. Actual total energy consumption comparison for EERSP protocol with positive e. A2 is set to be constant. The scenario of 
gir = Sri = i,g2r = Qri = 2 and £ = 0.5 is investigated. 
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Fig. 8. Average Queue Length at Si and R for ^2 versus varying e for PO. All link gains are assumed to be unity. 
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Appendix A 
Proof for Proposition [2] 

In this appendix, we give the proof for Proposition [2l Firstly, let us recall that the probability density 
function (pdf) of link gain over a Rayleigh fading channel is given by 

p(^7,) = -exp(-^). (78) 

gi Qi 

Hence, substituting (l78l) into (l57l) . we can derive R* as follows. 

/OQ \ Q 

\og^{Plg^ log2 e)-eM--)d9^ (79) 

= -log2(Q^,)exp(-^)|,°?+ / ^-exp(-^)rf^, (80) 

c ■ 

■exp{- — )dgi (81) 



gi In 2 gi' 

exp( — ^)dgi (82) 



'i 5'iln2 c*5(i 
Where c* = /3* logg e for short. In the same manner, P* is given by, 

°° 1 ^. 

^exp(-— )% (83) 

°° c* a- 

^exp{~^)dgi. (84) 

Substituting (f82l) and (f84l) into (l50l) . we obtain the following lemma. 

Lemma 4: Under the optimal energy solution P4 over Rayleigh fading channels, optimal power level 

and optimal rate of each active mode can be linked by 

°° c* r* a- 

- ^) eM-^)dg^ + 7* = 0. (85) 

1 g' gi c*gi 



Note that c* is an implicit function of gi. Take derivatives with respect to gi on both sides of (I85I) . we 
obtain 

dc; jrij!-i)'>M-^)d9. dfec;) r(j--i)<=xp(-j|-)''«- „ 

dgi c* dgi c*gf 

Since ^ — — < and — — 1 < hold if a > 1, it can be deduced that ^ and ^^f^ are with different 

dc* diffiC*) n 

Signs, I.e., -rj- ^y-.'' < 0. 

=> ' 'dgi dgi 

Hence there are only two feasible scenarios: 1) ^ > and '^^f^."' ^ < 0; 2) ^ < and > 0. 

J ' dgi dgi ' ^ dg^ dg^ 

However, it can be immediately deduced that, if ^ > 0, ^^^^ = c* + gi^ > given c*,5'j > 0. A 



25 



contradiction occurs for scenario 1) and it cannot be feasible. Therefore the latter scenario is uniquely 
feasible, i.e., ^ < and > 0. 

Based on the above analysis, we seek to find the relation of P* and R* with (ji by deriving the first-order 
derivative of P* and R* with respect to gi for each active mode. 
From (f82l) . the derivative of R* with respect to (ji can be given by, 

dR* d{g,c*) JTih^M-^^dg, 



^2 > (87) 

where (|87l) comes from the conclusion that > 0. Therefore, R* is an increasing function of cji for 



dgi dgi {c*gi) 



each active mode in the ergodic case as well. 

In a similar way, the derivative of P* in (|84|) with respect to gi is derived as follows. 



dPl ^ dc*P* ^ djg.ci) I^j-^M-:SJ^)dg^ ^^^^ 

dgi dgi c* dgi {clhY 



dot r jf ^M-^Jdg^ /r(l - j) ^M-^Jdg. 
dg-} /r(l-^)exp(-^)ci^, 

dc* ir jj expj-^Jdgi exp(- - {J^ j- exp(- 

d9^ ^ ' /r(i-i)exp(-^)rf^. 



(89) 



(90) 



^ ^ c. 3i' 

< (91) 



where (1891) can be obtained by substituting the equality of (1861) into (|88|) and (|9TI) comes from Cauchy- 
Schwarz inequality and the fact that ^ < 0. 

Hence P* is an increasing function of gi for each active mode in the ergodic case as well. Proposition 
|2] then is proved. 
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